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Abstract 

The  asymptotic  distribution  of  a  U-statistic  is  found  in  the  case  when 
the  corresponding  Von  Mises  functional  is  stationary  of  order  1.  Practical 
methods  for  the  tabulation  of  the  limit  distributions  are  discussed,  and  the 
results  extended  to  certain  incomplete  U-statistics . 
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§  1 .  Introduction. 

Let  Xj,...,Xn  be  independent  random  k-vectors  with  common  distribution 

function  F.  Let  4>(x,,...,x  )  be  a  function  on  R.  symmetric  in  the 

~m  Km 

k-vectors  and  let 


6(F) 


=  /.../  4>(x. 


IR. 


,Xm) 


m 

n  F(dx  ) 
i  =  l  1 


(1) 


be  a  functional  defined  on  the  space  of  d.f.s.  on  assuming  that  the 

integral  exists.  An  unbiased  estimate  of  0  =  0(F)  is  furnished  by  the 
U-stat istic 


U 

n 


I  4>(X.  ,...,X  ) 

1  m 


(2) 


where  the  sum  extends  over  all 
[10],  define 


subsets  of  the  X. . 

l 


Following  Hoeffding 


4>c(xi . V  =  j » •  •  ■ » xc  *  Xc+l”-,,Xm)]  C  . m 

and  Y  (Xj,. . .,xc)  =  *c(Xj, . . .  ,xc)  -  0,  assuming  all  expectations  exist. 
Also  define 


;0  =  0  ,  Cc  =  Var(4'c(X1,...,Xc))  ,  c=l,2, 


If  for  some  nonnegative  integer  d  <  m,  *  0  but  t,c  =  0  for  c  =  0 . d, 

the  functional  (1)  is  said  to  be  stationary  of  order  d  at  F.  If  d  =  0,  then  (2) 
has  an  asymptotical ly  normal  distribution  after  suitable  normalization  [10]. 

If  d  *  0,  the  distribution  is  no  longer  normal  and  may  be  obtained  by 

distr!Bot:qn/ava.: 

Disl  ft  t.l  ’ 


1 


l;tv  cants  I 

„t  SPiCIAL] 


n  □ 


applying  the  theory  of  differentiable  statistical  functionals  due  to 
Von  Mises  and  Filippova  ([IS], [7]). 

Let  Fn  be  the  empirical  distribution  function  of  Xj,...#X  .  The 
distribution  of 


e(Fn) 


1 


-  I  J  .  »(Xi, . Xi  > 

n  l  =1  l  =1  1  m 

1  m 


(3) 


is  closely  related  to  the  distribution  of  U  as  described  in  §2,  and  the 
asymptotic  distribution  of  9(F)  is  given  by  the  following  theorem. 


Theorem  (Von  Mises,  Filippova).  Let  0  be  a  stationary  functional  of  order 

d»l 

d.  Then  the  asymptotic  distribution  of  n  2  (0 (F  )-0)  is  identical  to  that 

n 

of 


d+1 


d 

JI 

i=l 


(Fn(dx.)-F(dx.))  . 


(4) 


In  the  case  d  =  1,  the  asymptotic  distribution  of  (4)  can  be  found 
using  integral  equation  techniques.  Filippova  gives  an  expression  for  the 
characteristic  function  of  the  asymptotic  distribution  of  (4)  in  terms  of 
Fredholm  determinants.  Gregory  [8]  gives  a  more  concrete  representation  of 
the  asymptotic  distribution  of  (4)  as  an  infinite  series  of  random  variables, 
in  the  case  m  =  2.  Below  we  combine  these  results  to  give  explicit 
expressions  for  the  c.f.s.  of  the  limit  distributions  and  discuss  practical 
methods  for  the  tabulation  of  the  limit  distributions.  We  also  make  some 
remarks  about  incomplete  U-statistics. 


* 


3 


§2.  Relationship  between  0(Fn)  and  Un* 


Let  I  be  the  set  of  indices  { (ij , . . . , im) :  1  <  i^  <  n,  l  =  and  1 

I  be  the  subset  of  7  consisting  of  those  m-tuples  with  exactly  j  distinct 

j 

integers.  Define  for  each  j  =  i . m  a  symmetric  kernel  4*(j)  (xj»  •  * '»xj)  by 

j  ! ^ .  (x. , .  .  . , x . )  =  <Mx.  ,...,x  )  where  the  sum  is  taken  over  all 

vjl  1  J  Ul  1m 

m-tuples  of  indices  (i^  •••.!„)  with  1  <  i^  <  j  such  that  exactly  j  of  the 

i  are  distinct.  Then  9(F  )  =  £  4>(X.  ) 

1  n  n  I  1  m 

i  m 

=  M  I  4>(X  , . . .  ,X.  ) 

n  j=l  I.  1  m 

J  J 


et 


=  ^  l  l  .  .  j  !4>(j)  (Xi  *  *  *  •  ,Xi  .> 

n  j=l  lsijS. . .<ij<n  J  1  j 


hi  \  >'■ 

n  j  =  l 


m 


n 

X 


where  is  the  U-statistic  corresponding  to  the  kernel  Note  that 

U(m)  =  U  .  Thus 
n  n 


n(9(Fn)-0)  =  n(Un-0)  ♦  Zn 


(5) 


where 


2  =  nU 

n  n 


(n-m) !  n 


-m 


-1 


m-1 

-  I 


n ! 


j=l  (n-j) !n 


-  u 

n 


(6) 


If  0  is  stationary  of  order  1,  then  n(9(Fn)-0)  converges  to  a  nondegenerate 

distribution  as  seen  in  §3,  and  Var(nUn)  is  bounded  as  n  +  ®  ([10]). 

,  .  ,  -m(m-l)  „  (m-1) 

Moreover,  EfZ^)  converges  to  the  quantity  - ^ -  +  0 


where 


4 


.  ( m- 1 ) 


F*lVl)(Xl . W1  and  Since  Var  ”n  “  °  n7 


and  Var  U ^ ^  =  0 [ 
n  I  n 


for  i  <  m  it  follow,  from  (ft)  that  Z  converges  in  probability  to 
J  n 

rJ*l5ir  LL  ♦  ami  so  the  asymptotic  distributions  of  »»(<!(  Hf|  I  -0)  mol 

n(U  -rt)  differ  only  by  a  shift.  A  more  explicit  expression  for  0  "*  * '  is 
n 


,  (m- 1) 


Ef4l(m-l)(Xl . VI 


t — Ivr  y r  F.(*(X.  . X.  )) 

(m-1)!  Mm-1)  ij  \m 


1  m! (m-  1 1 


TiTTj! 


F-U(Xi  .Xj.X, . Xm-i^) 


=  — ~  Et^CXj.Xj)) 


rmV 


and  so  we  may  write  lim  E(Z  )  =  1  im  F.  (0(F  1-0)  =  ,  K (H1  -, ( X .  , X.  1 ) . 

n  n  nn  n  t  \  £  i  l 


§3.  Asymptotic  distribution  of  n (0 (P  ) -0)  and  n ( ^ 1 . 

By  the  Von  Mises  -  Filippova  theorem,  the  asymptotic  distribution  of 

n(0(F  )-0)  coincides  with  that  of 
n 


^(Xj  ,x,)Gn(dx1)Gn(dx,) 


(7) 


where  G  (x)  =  »^n(F  (x)-F(x)).  Consider  the  kernel  ^(x-.x,)  where  (all 
n  n  *•  *  - 

integrals  are  over  IR^) 


VVV  =  "VVV  '  /  'Mx1,v)F(dv)  -  /  H',(u,x:)F(du)  ♦  /  /  Y(u.v)F(du)F(dv)  . 

Then  it  is  easily  seen  that 


5 


/  /  »2(xl.x2)c;i>(dx1)C|l(d*2)  •  /  /  »(x,,x2)cil(dx1)cn(d»2) 

■s.I.lvW-  w 

1=1  j  =  l 


Now  let  X . ,  j  ■  1,  be  the  (real)  eigenvalues  of  the  linear  operator 
l.,(lRk,dF)  *•  L  (IRk.dF)  with  (symmetric)  kernel  1',.  We  note  that  by  the 
assumption  that  r  exists,  /  /  |'Mu,v)i2  F(du)F(dv)  <  ».  By  the  results  of 
[8],  we  can  see  that 

(i)  the  asymptotic  distribution  of  (8)  is  the  same  as 


i  ml 


i,  yyn  +  E(wxi 


)) 


(9) 


» 

where  the  Y.  are  independent  x~  random  variables;  and 

uo 

(ii)  ^  y  J \  J  <oo  then  the  asymptotic  distribution  of  (8)  is  that  of 

j-1  J 

r  \  00 

"  l  X.Y.. 

Uij.l  J  J 

To  prove  these,  set  the  functions  h^  in  Theorem  2.1  of  [8]  to  be  0,  then 

OP 

I  Y  'V„  (X..X.)  converges  in  distribution  to  y  X.(Y.-l).  Since 
n  2  i  J  j  =  l  '  J 

i  J,  \x  YW  "  "  X  VW  *  i  Jj  VYV  and  iT  jj  VXi’V 

converges  in  probability  to  E [^(Xj  ,Xj)  ]  by  the  WI.LN,  (9)  follows  upon 


noting  that 


ElMXj.X^]  =  /  'l'2(x1,x1)F(dx1)  -  P.f'MX^X,)) 
=  /  4'2(x1,x1)F(dx1) 

■  E[T2(X1 ,Xj) 1  } 


since  F.  ['*’_(  X  » X_)  ]  =  0  by  [10].  (ii)  follows  as  in  1.2.3  of  [8]. 

4  1  4 

Using  (i)  and  (ii)  above  and  (5)  we  see  that  the  asymptotic 


distribution  of  n(U  -6)  is  that  of  ,  V  X.(Y.-l),  and  if  V  |X.|  <  ",  t 
n  3  J  3 

t  )  r  * 

distribution  is  that  of  '  T  X.Y.  -  EpF,CX,  ,X, )] !  .  The  characteristic 

(2J  >i  J  J  2  1  1  j 


distribution  is  that 


functions  of  the  limit  distributions  arc  thus  seen  to  be 
00  -itX'  , 

4* ( t )  =  n  e  -1  (l-2iX|t)  *  in  the  former  case  and 

j-1  3 


4>(t)  =  e  M  0 


(l-2ix!t)'J  in  the  latter  where  p  =  E [^(X^  , Xj )  ]  and 


x!  =  "  X..  j  2  1. 
J  2  j 


§ 4 .  Tabulation  of  the  limit  distribu t ion . 

In  this  section  we  discuss  practical  means  of  computing  the  limit 
distribution  of  nfU^-P).  We  first  consider  the  case  when  the  eigenvalues  X. 
are  summable.  The  method  employed  to  tabulate  the  limit  distribution  will 
depend  on  the  amount  of  information  we  possess  about  the  eigenvalues.  There 
are  three  subcases. 

(i)  The  eigenvalues  must  be  determined  numerically,  and  we  have 
accurate  determinations  of  X.  for  1  s  j  v  ». 

(ii)  The  eigenvalues  can  be  explicitly  determined  for  all  j. 


(iii)  The  char 


acteristic  function  of  ]  X.Y..  nameU 

jii  2  J 

II  (l-2itX.)  ,  can  be  expressed  ir,  clos 


4>(t)  =  II  (I-2itX.)  ,  can  be  expressed  in  closed  form. 

j»l  J 

In  subcase  (iii),  the  limit  distribution  may  most  conveniently  be  found  by 
numerical  inversion  of  4>(t).  Various  methods  are  available  (e.g.  Pavies 
[5],  Bohman  [3],  Martynov  [12]).  In  subcase  (ii),  computation  of  the  inner 
product  is  necessary.  A  reasonable  way  to  accomplish  this  is  described  in 
[2]  and  consists  of  expressing  4>(t)  r  FI  (l-2itX.)  2  ^(t)  where 


log 

4>(t)  = 

log 

<Mt)  = 

The 

formal 

Since  |CNk 

|t| 

1 

5  2C~ 

N 

-S  )  log ( 1 -2i t X . ) .  Formally,  we  have 
j-N  J 


rt>  a> 


S  l  l  C2itx  )K/k  -Sic  k(2it)k/k  where  C  '  -  £  X* 

j-N  k=l  }  k=l  *  ,K  j-N  J 

manipulations  above  are  valid  if  the  last  power  series  converges. 


I  lx, 

J-N 


J 


k  =  Ck  say,  this  convergence  will  take  place  if 


The  numerical  inversion  methods  above  will  require  the 


computation  of  at  a  finite  number  M  of  ordinates  t  ,  so  choose  N  so  large 
that  ('^  <  (2  mj^x  |t  |)  *  and  use  the  power  series  to  compute 

In  special  cases  other  methods  may  be  used.  if  the  eigenvalues  are 
all  positive,  the  asymptotic  results  of  Zolotarev  [14]  and  Hoeffding  [11] 
furnish  good  approximations  for  the  tail  probabilities.  If  the  eigenvalues 
are  all  positive  and  of  multiplicity  unity,  then  Smirnov's  formula  may  be 
used  as  an  efficient  numerical  inversion  method,  see  Martynov  [12]  for 
detai 1 s . 

In  subcase  (i)  two  possible  approaches  suggest  themselves.  The  use 


of  the  distribution  of  £  X.Y.  to  approximate  that  of  J  X.Y.  will  not  be 

j  =  l  -1  •’  j  =  l  -1  -1 

satisfactory  in  general  for  reasonable  values  of  N.  For  a  discussion  of 

this  point  and  bounds  on  the  truncation  error  see  [2].  Rather,  some  method 
of  approximating  the  tail  of  the  series  is  required.  If  all  the  X.  for 

j  >  N  are  positive,  we  may  approximate  (see  [6])  the  distribution  of 

on  N 

where  Y  is  an  x"  variate  and  c  and  v  arc 


l  X.Y.  by  that  of  [  X.Y.  ♦  cY 
j  =  l  ■  J  j=l  3  ■* 


v 


chosen  to  make  the  mean  and  variance  of  the  two  distributions  coincide. 


Thus  c  and  o  are  obtained  from 


yielding 


r  1  7 

l  \  ♦  c‘v  »  Var(4',(X.fXJ)  = 
j»l  J  ‘  1  2 

N 

I  ♦  CV  -  E[V  (X  X  )] 

j’l  ]  ‘  1 

4  -  l  >j 

- - — 

(E|y,(x  x  )]  -  l  \  ) 
j  =  i  3 


if  N  i 

v  =  ^  Xjj 


The  c.f.  of  the  approximating  r.v.  is 


11 

.(t)  =  n  (l-2iX.t)''J  (l-2ict)'v/: 


and  a  numerical  inversion  of  (10)  furnishes  the  desired  approximation. 

If  all  but  a  finite  number  of  the  \.  are  not  of  the  same  sign,  one  may 
N-l  3 

compute  the  density  of  £  X.Y.  by  e.g.  the  integral  equation  method  of 

j  =  l  3  3 

Grenander  et^  a_K  (0J  and  approximate  the  distribution  function  of  the 

00 

remainder  V  \.Y.  by  means  of  a  Cornish-Fisher  expansion.  The  cumulants  of 
j«N  3  3 

the  remainder  are  easily  seen  to  be 


N-l  ^ 

so  <j  =  EfY  (Xj.Xj)]  -  £  ~  ^2  ~  I  an(*  ^or  r  ^  ^  the  <  may  be 

j=l  3  *  *  j=l  3  r 

approximated  reasonably  well  by  computing  a  few  more  eigenvalues.  The  limit 

df  may  then  be  obtained  by  a  numerical  convolution. 


9 


If  all  but  a  finite  number  of  eigenvalues  are  positive  (or  negative) 

then  the  kernel  can  be  expressed  as  the  sum  of  a  degenerate  and  a 

positive  definite  (negative  definite)  kernel  and  hence  its  eigenvalues  will 

be  summable.  Thus  in  the  nonsummable  case,  an  infinite  number  of  both 

positive  and  negative  eigenvalues  will  be  encountered.  If  only  a  finite 

number  are  known,  we  may  employ  the  Cornish- Fisher  method  suggested  above. 

00  -itX.  , 

If  all  are  known,  we  may  compute  <J>.,(t)  =  II  e  (l-2iX  .t) "  2  by 

r  k/  “  j=N  J 

log  d>  Ct )  =  -4  i  C  ,  (2it)  / k  and  proceed  as  before. 

1  k=2  N,K 


§5.  Some  remarks  on  incomplete  U-statistics . 

The  calculation  of  the  U-statistic  (2)  requires  the  averaging  of 

'  ' 

n  terms,  which  may  not  be  practical  if  m  and  n  are  not  small.  To  reduce  the 
m 

volume  of  computation  Blom  [1]  and  Brown  and  Kildea  [4]  have  proposed  the 
use  of  incomplete  U-statistics  of  the  form 


<Kxi . x.  ) 


(ii) 


when  the  sum  in  (11)  is  taken  over  N  specified  or  randomly  selected 
m-subsets  of  the  indices.  The  asymptotic  distributions  of  nondegenerate 
incomplete  U-statistics  (^  >  0)  are  studied  in  the  references  above.  For 
the  degenerate  case  c,j=0,t,2>0  some  of  their  Tesults  remain  true  while  others 
need  modification. 

Denote  by  <f>.  the  r.v.  (X .  )  where  (ij,...,im)  is  the  i-th 

1  m 

subset  of  indices  in  the  summation  (11).  As  in  [1],  let  pc  denote  the 
2 

proportion  of  the  N  pairs  (4>.  ,<(>.)  having  c  indices  in  common,  p  will  be  a 

1  j  c 

fixed  constant  or  an  r.v.  depending  on  the  method  of  subset  selection.  From 
[1]  we  have 


10 


Var  U 


m 

l 

c  =  2 


E(PcKc 


and  the  assumption  is  made  that  n  Var  U  -*•  6  where  li  is  some  nonnegative 
constant.  Note  that  the  E(p  )  do  not  depend  on  $  but  only  on  the  method  of 
subset  selection. 

The  asymptotic  distribution  of  U  depends  on  the  ratio  n/N.  If  n/N  -+  00 

as  n.N  both  •*  <*>;  the  quantities  4^  are  asymptotically  independent;  and  the 

degeneracy  of  the  complete  U-statistic  is  irrelevant  to  the  limit 

distribution  of  *n(U-8),  which  will  be  normal  with  mean  zero  and  variance 

t,  .  On  the  other  hand,  if  n/N  ■*  0,  the  limit  distribution  of  U  will 
m 

coincide  with  that  of  the  complete  U-statistic  under  certain  conditions. 

For  example,  from  [1]  we  have  (denoting  the  complete  U-statistic  by  U^) 


Var  U  -  Var  U0  =  E(U-U0)‘ 


so  n(UQ-0)  and  n(U-9)  will  have  the  same  asymptotic  distribution  if 

2  2  f  ml  ^ 

n“(Var  U  -  Var  U  )  converges  to  zero.  Now  n^  Var  lln  converges  to  2i,:  t,. 


so  a  sufficient  condition  for  the  coincidence  of  the  asymptotic 
distributions  is 


lim  n  E(pc)  =  <{ 
n-*® 


hi” 


c  >  2  . 


Alternatively,  if  the  N  subsets  are  chosen  at  random,  with  replacement  from 

fnl 


the 


possible  subsets,  then  ([!]) 


Var  U  -  Var  UQ  =  -  Var  UQ) 


11 


4. 

so  the  asymptotic  distributions  coincide  if  n  /n  converges  to  zero.  Thus 

2+£ 

the  incomplete  U-statistic  based  on  n  randomly  chosen  subsets  will  have 


the  same  asymptotic  distribution  as  that  of  the  complete  U-statistic  based 


on 


subsets . 
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